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Description 

Field of the Invention 

5 [0001] The present invention relates to immunogenic detoxified proteins of cholera toxins (CT), or of heat labile toxins 
(LT) produced by the enterotoxigenic strains of Escherichia coli(E.coh) characterized in that the amino acid at pos Ser- 
63 is replaced. Said immunogenic detoxified protein can additionally have substitutions at one or more of amino acids 
Val-53, Val-97, Tyr-104 or Pro-106. The present invention to relates their use in vaccines which are useful for the 
prevention or treatment of cholera or enterotoxigenic E.Coli infections. The proteins can be suitably produced using 

10 recombinant DNA techniques by site-directed mutagenesis of DNA encoding the wild type toxins. 

Background to the Invention 

[0002] Cholera is a contagious disease widely distributed in the world, in particular in the Third World, where, in 
is certain areas, it is endemic. The serious disorders which develop in the intestinal system prove fatal in a high percentage 
of the recorded cases of the disease. 

[0003] The etiological agent of cholera is the Gram-negative microorganism Vibrio cholerae (V.cholerae). This col- 
onises the intestinal tract of individuals who have come into contact with it through ingestion of contaminated food or 
water, and multiplies to very high concentrations. The principal symptom is severe diarrhoea as a result of which the 
20 patient can lose as much as 10-15 litres of liquids per day via the faeces. As a result of the severe dehydration and 
loss of electrolytes, the patient does not withstand the infection in 50-60% of cases, and dies. The diarrhoea caused 
by V.cholerae is due to the secretion of cholera toxin, CT, which acts by stimulating the activity of the adenylate cyclase 
enzyme so as to induce disturbances at cell level. 

[0004] Although cholera can be effectively cured by controlled and intense rehydration, the distribution of a vaccine 

25 is desirable with a view to complete control and future eradication of the disease. 

[0005] At the present time, there exists a vaccination against cholera, consisting of parenteral administration of killed 
bacteria. Although some countries insist on vaccination against the disease, there are serious doubts as to its real 
usefulness, given that the current cellular vaccine protects against the consequences of the infection in only 50% of 
the cases and that the protection is also extremely limited in duration, to less than 6 months. 

30 [0006] In Bangladesh, an experimental trial is in progress (1990-92) of an oral vaccine consisting of killed bacteria 
with the addition of subunit B of cholera toxin, which is known to be highly immunogenic. This product succeeds in 
inducing lasting protection, without special side effects (Holmgren J., Clemens J. t Sack DA., Sanchez J. and Svenner- 
holm AM; "Oral Immunization against cholera" Curr. Top. Microbiol. Immunol. (1988), 146, 197-204). 
[0007] Cholera toxin resembles the heat labile toxins of enterotoxigenic strains of Escherichia coli in amino acid 

35 sequence, structure and mode of action. 

[0008] The consequences of infection with an enterotoxigenic strain of E.coliare similar to, though less serious than, 
those of cholera, and consist of severe diarrhoea and intestinal disorders. 

[0009] The CT and LT toxins all comprise a single A subunit (or protomer A) responsible for the enzymic activity of 
the toxin (herein CT-A or LT-A) and five identical B subunits (or protomer B) which are involved in the binding of the 
40 toxin to the intestinal epithelial cells (herein CT-B or LT-B). 

[0010] The A subunit penetrates the cell membrane and causes activation of adenylate cyclase by NAD-dependent 
ADP-ribosylation of a GTP-binding protein which controls the activity of the enzyme. The clinical effect of this is to 
cause massive fluid loss into the intestine. 

[0011] Considerable research has been conducted on cholera toxin and the E. coli heat labile toxins. 
45 [0012] The sequence of CT is known and has been described (Mekalanos J.J. et al Nature 306, page 551 (1 983)). 

[001 3] The sequence of LT from enterotoxigenic strains of E.colils, as mentioned, 80% homologous to CT and it too 

is known and described in the scientific literature. Spicer E.K. era/ (Biol. Chem. 257 p. 5716-5721 (1982)) describe 

the amino acid sequence of the A sub unit of the heat labile toxin from an enterotoxigenic strain of E. co//found in pigs. 

[0014] A bacterial chromosomal form of LT has been identified and sequenced by Pickett C.L et al (J. Bacteriol. 
50 169, 5180-5187, (1987). 

[001 5] The sequence of the A subunit of LT from a strain of E. coli known to affect humans has also been sequenced 

(Yamamoto et al, J. Biol. Chem., 259, 5037-5044, (1 984)). 

[001 6] In view of the potential clinical significance of a vaccine against cholera and enterotoxigenic bacteria there is 
a continuing and great interest in producing a detoxified toxin capable of immunising against cholera and enterotoxi- 
55 genie bacteria. The techniques of genetic engineering allow specific mutations to be introduced into the genes encoding 
the toxins and the production of the mutated toxins using now conventional techniques of gene expression and protein 
purification. 

[0017] various groups have attempted to identify mutations of the genes, which involve loss of the toxicity charac- 
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teristics of the encoded proteins. The studies are predominantly being carried out in respect of the gene for the toxin 
LT, from E coll 

[0018] Harford, S. efa/(Eur. J. Biochem. 183 , page 311 (1 989)) describe the production of a toxoid by in wtromu- 
tagenesis of the LT-A gene from E.cbli pathogenic for pigs. The resulting successful mutation contained a Ser-61 -Phe 
s substitution and a Gly-79-Lys substitution, the former being considered the more important. Harford et a/suggest that, 
because of the similarities between the LT-A genes in E.coli pathogenic to humans and pigs and the CT-A gene, and 
because the toxins are thought to operate by a common mechanism, it may be possible to produce a cholera holotoxoid 
by introducing the Ser-61 -Phe mutation into the CT-A gene. 

[0019] Tsuji, T et al (J. Biol. Chem. 265, p. 22520 (1990)) describe the mutation of the LT-A gene from plasmid 
10 EWD299 to produce a single substitution Glu-112-Lys which affects the toxicity of the mutant LT yet does not change 
the immunogenicity of the protein. 

[0020] Grant, C.C.R. et al (Abstract B289 of the 92nd General Meeting of the American Society for Microbiology, 
26-30th May 1 992) describe conservative substitutions of histidines at 44 and 70 and tryptophan at 1 27 in LT-A which 
result in significant reductions in enzymic activity. 
15 [0021] Some work has been conducted on mutations to CT 

[0022] Kaslow, H.R. et al (Abstract B291 of the 92nd General Meeting of the American Society for Microbiology, 
26-30th May 1 992) describe mutating Asp-9 and His-44 and truncating after amino acid 1 80 in CT-A which all essentially 
eliminate activity. Mutating Arg-9 is said to markedly attenuate activity. Mutating other amino acid sites had little effect 
on toxicity. 

20 [0023] Bumette, W.N. et a/(lnf. and Immun. 59(11) , 4266-4270, (1991)) describe site-specific mutagenesis of CT-A 
to produce an Arg-7-Lys mutation paralleling that of a known detoxifying mutation in the A subunit of the Bordetella 
pertussis toxin. The mutation resulted in the complete abolition of detectable AD P-ribosyltransf erase activity. 
[0024] International patent application WO 92/1 9265 (Burnette, Kaslow and Amgen Inc.) describes mutations of CT- 
A at Arg-7, Asp-9, Arg-1 1 , His-44, His-70 and Glu-1 12. 

25 [0025] Mutations at Glu-110 (LT and CT) and Arg-146 (LT) have also been described in the literature (Lobet, Inf. 
Immun., 2870, 1991; Lai, Biochem. Biophys. Res. Comm. 341 1983; Okamoto J. Bacteriol. 2208, 1988). 
[0026] The crystal structure of LT has been determined by Sixma et al (Nature, 351 , 371 -377, May 1 991 ) and confirms 
the mutatagenesis results described earlier in the literature, explaining structurally the significance of Glu-1 1 2 and Ser- 
61 in activity of the A sub unit and suggesting that His-44, Ser-1 1 4 and Arg-54 which are in the immediate neighbourhood 

30 may be important for catalysis or recognition 

Summary of the invention 

[0027] It has now been discovered by further and more detailed analysis of the structure of the toxins that certain 
35 further amino acids in the sequences of CT-A and LT-A are in positions capable of decreasing the enzymatic activity 
of CT and LT when mutated suitably, individually or in conjunction with other mutations. 

[0028] The object of the present invention is to provide a vaccine which gives total protection against cholera or 
enterotoxigenic E. coli, by means of a second generation product consisting of a single antigen, a toxoid derived from 
CT or LT, which has been detoxified genetically. 
40 [0029] The genetic detoxification of CT or LT retains the immunogenic properties of the toxoid whilst providing a 
significantly reduced and preferably absent toxicity. 

[0030] According to a first aspect of the invention, there is provided an immunogenic detoxified protein comprising 
the amino acid sequence of subunit A of cholera toxin (CT-A) or a fragment thereof or the amino acid sequence of 
subunit A of Escherichia co// heat labile toxin (LT-A) or a fragment thereof, wherein the amino acid at, or in the position 
45 corresponding to Ser-63 is replaced with another amino acid. 
Ser-63 is preferably replaced with lysine ("Ser-63-Lys"). 

[0031] The replaced amino acid is at a location in the sequences of CT-A or an LT-A which is conserved both in the 
amino acid sequence and structurally and is thus common to CT and the various LTs. 

[0032] The immunogenic detoxified protein of the invention adopts substantially the same structural conformation 
50 as the wild type naturally occuring toxins. It is immunologically active and cross reacts with antibodies to the wild type 
toxins. 

[0033] In this specification, references to CT and LT encompass the various naturally occurring strain variants as 
well other variants encompassing changes from the sequences disclosed herein which do not affect the immunogenicity 
of the assembled toxoid. 

55 [0034] In this specification, references to amino acid coordinates such as m \fa\-97 m connote the amino acid at that 
position in the sequence of the mature cholera toxin subunit A (CT-A), that is without the signal sequence (see Figure 1 ). 
[0035] Where the specification refers to an LT-A, the amino acid coordinates refer to the corresponding position in 
CT-A as shown in Figure 1 . 
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[0036] . Thus, for example, val-53 in CT corresponds to Val-52 in the LT1 subunit and Ser-63 in CT corresponds to 
Ser-62 in LT1 , there being a single amino acid difference in numbering up to amino acid 89 of the LT1 sequence. val- 
97 in the CT sequence corresponds to \fel-93 in the LT1 sequence because of the four amino acid difference at that 
point in the sequence. 

5 [0037] In addition, the immunogenic detoxified protein of the invention may include other mutations such as, for 
example, substitutions at one or more of Arg-7, Asp-9, Arg-1 1 , His-44, val-53, Arg-54, Ser-61 , His-70, Val-97, Tyr-1 04, 
Pro-106, His-107, Glu-110, Glu-112, Ser-114, Trp-127, Arg-146 or Arg-192. 

[0038] The amino acid substituted for the wild type amino acid may be a naturally occurring amino acid or may a 
modified or synthetic amino acid. The substitution may involve deletion of an amino acid altogether provided that the 
10 mutant retains the necessary immunogenic properties and exhibits a substantially reduced toxicity. 

[0039] Substitutions which alter the amphotericity and hydrophilicity whilst retaining the steric effect of the substituting 
amino acid as far as possible are generally preferred. 

[0040] Preferred substitutions include: Val-53-Asp, Val-53-Glu, Val-53-Tyr, val-97-Lys, Val-97-Tyr, His-107-Glu, Tyr- 
104-Lys, Tyr-1 04-Asp, Tyr-1 04-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

15 [0041] As used herein, the term "detoxified" means that the immunogenic composition exhibits a substantially lower 
toxicity relative to its naturally occurring toxin counterpart. The substantially lower toxicity should be sufficiently low for 
the protein to be used in an immunogenic composition in an immunologically effective amount as a vaccine with causing 
significant side effects. For example, the immunogenic detoxified protein should have a toxicity of less than 0.01% of 
the naturally occurring toxin counterpart. The toxicity may be measured in mouse CHO cells or preferably by evaluation 

20 of the morphological changes induced in Y1 cells. The term "toxoid" means a genetically detoxified toxin. 

[0042] The immunogenic protein may be a CT or LT subunit A toxoid, but is preferably an assembled toxin molecule 
comprising a mutated CT-A or LT-A subunit and five B subunits of CT or LT. The B subunit may be a naturally occurring 
subunit or may itself be mutated. 

[0043] The immunogenic protein is preferably a naturally occurring CT-A or an LT-A suitably modified as described 
25 above. However, conservative amino acid changes may be made which do not affect the immunogenicity or the toxicity 
of immunogenic protein and preferably do not affect the ability of the immunogenic protein to form complete toxin with 
B subunit protein. Also, the immunogenic protein may be a fragment of CT-A or an LT-A provided that the fragment is 
immunogenic and non toxic and contains at least one of the conserved regions containing one of the mutations ac- 
cording to the invention. 

30 [0044] According to a second aspect of the invention, there is provided an immunogenic composition for use as a 
vaccine comprising an immunogenic detoxified protein of the first aspect of the invention and a pharmaceutically ac- 
ceptable carrier. 

[0045] The immunogenic composition may additionally contain one or more adjuvants and/or pharmaceutically ac- 
ceptable diluents. 

35 [0046] The invention also provides a vaccine composition comprising an immunogenic detoxified protein according 
to the first aspect of the invention and a pharmaceutically acceptable carrier. The vaccine composition may further 
comprise an adjuvant. 

[0047] According to a third aspect of the invention, there is provided a method of vaccinating a mammal against 
Vibrio cholerae or an enterotoxigenic strain of Escherichia coli comprising administering an immunologically effective 
40 amount of an immunogenic detoxified protein according to the first aspect of the invention. 

[0048] The immunogenic detoxified proteins of the invention may be synthesised chemically using conventional pep- 
tide synthesis techniques, but are preferably produced by recombinant DNA means. 

[0049] According to a fourth aspect of the invention there is provided a DNA sequence encoding an immunogenic 
detoxified protein according to the first aspect of the invention. 
^5 [0050] Preferably the DNA sequence contains a DNA sequence encoding a complete CT or LT comprising DNA 
encoding both the detoxified subunit A and subunit B in a polycistronic unit. Alternatively, the DNA may encode only 
the detoxified subunit A. 

[0051] According to a fifth aspect of the invention, there is provided a vector carrying a DNA according to the fourth 
aspect of the invention. 

50 [0052] According to a sixth aspect of the invention, there is provided a host cell line transformed with the vector 
according to the fifth aspect of the invention. 

[0053] The host cell may be any host capable of producing CT or LT but is preferably a bacterium, most suitably E. 
coli or V.choterae suitable engineered to produce the desired immunogenic detoxified protein. 
[0054] In a further embodiment of the sixth aspect of the invention, the host cell may itself provide a protective 
55 species, for example an E.cotiov V.choterae strain mutated to a phenotype lacking wild type LT or CT and carrying 
and expressing an immunogenic detoxified protein of the first aspect of the invention. 

[0055] In a further embodiment of the sixth aspect of the invention the host cell is capable of expressing a chromo- 
somal LT-A gene according to the first aspect of the invention. 
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[0056] According to a seventh aspect of the invention, there is provided a process for the production of an immuno- 
genic detoxified protein according to the first aspect of the invention comprising culturing a host cell according to the 
sixth aspect of the invention. 

[0057] According to a eighth aspect of the invention there is provided a process for the production of DNA according 
s to the fourth aspect of the invention comprising the steps of subjecting a DNA encoding a CT-A or an LT-A or a fragment 
thereof to site-directed mutagenesis. 

[0058] According to a ninth aspect of the invention there is provided a process for the formulation of a vaccine 
comprising bringing an immunogenic detoxified protein according to the first aspect of the invention into association 
with a pharmaceutical ly acceptable carrier and optionally with an adjuvant. 

10 

Industrial Applicability 

[0059] The immunogenic detoxified protein of the invention constitutes the active component of a vaccine composi- 
tion useful for the prevention and treatment of cholera infections or infections by enterotoxigenic strains of E.colL The 
is compositions are thus applicable for use in the pharmaceutical industry. 

Brief Description of the Drawings 

[0060] Figure 1 shows the amino acid sequences of the wild type subunrt A from: 

20 

i) cholera toxin (CT - Mekalanos et a/ op cit), 

ii) heat labile toxin from an E.cofi strain found in man (LT1_1 - Yamamoto et al op cit) 

iii) heat labile toxin from an E.coli strain found in pigs (LT1- Spicer et a/ op cit), and 

iv) heat labile toxin from a chromosomal source (LT1_1 - Pickett et a/ op cit) 

25 

[0061] The signal sequences are not shown. 

[0062] In Figure 1 , the conventional single letter amino acid code is used. The symbol V denotes an absent amino 
acid and acts as a typographical spacer to ensure that the sequences remain in alignment for ease of comparison. 
The symbol indicates an amino acid in the sequences of LT1 and LT2 which is identical to the corresponding amino 
30 acid in CT. The numbers against each line are the amino acid number of the first amino acid on that line. 
[0063] In Figure 1 the positions of the mutations of the present invention are shown underlined. 
[0064] Figures 2a and 2b are comparisons of the amino acid and DNA sequences of the A sub units of LT1 and CT. 
[0065] Figure 3 is a restriction map of plasmid EWD299 (Dallas et al), bearing the LT-A gene. 

35 Detailed Description of Embodiments of the Invention 

[0066] The practice of the present invention will employ, unless otherwise indicated, conventional techniques of 
molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such tech- 
niques are explained fully in the literature. See e.g., Sambrook, et al., MOLECULAR CLONING; A LABORATORY 

40 MANUAL, SECOND EDITION (1989); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); OLIGONUCLE- 
OTIDE SYNTHESIS (M.J. Gait ed, 1984); NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds. 1984); 
TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins eds. 1 984); ANIMAL CELL CULTURE (R.I. Fresh- 
ney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MO- 
LECULAR CLONING (1984); the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); GENE TRANSFER 

45 VECTORS FOR MAMMALIAN CELLS (J.H. Miller and M.P Calos eds. 1 987, Cold Spring Harbor Laboratory), Methods 
in Enzymotogy Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., respectively), Mayer and Walker, eds. (1987), 
IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY (Academic Press, London), Scopes, (1987), 
PROTEIN PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition (Springer-Verlag, N.Y.), and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY, VOLUMES l-IV (D M. Weir and C. C. Blackwell eds 1986). 

so [0067] Standard abbreviations for nucleotides and amino acids are used in this specification. All publications .patents, 
and patent applications cited herein are incorporated by reference. 
[0068] In particular, the following amino acid abbreviations are used: 





Alanine 


A 


Ala 


55 


Arginine 


R 


Arg ! 




Asparagine 


N 


Asn 




Aspartic Acid 


D 


Asp 
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(continued) 



10 



15 



20 



Cysteine 


C 


j Cys 


Glycine 


G 


Gry 


Glutamic Acid 


E 


Glu 


Glutamine 


Q 


Gin 


Histidine 


H 


His 


Isoleucine 


I 


lie 


Leucine 


L 


Leu 


Lysine 


K 


Lys 


Methionine 


M 


Met j 


Phenylalanine 


F 


Phe 


Proline 


P 


Pro 


Serine 


S 


Ser 


Threonine 


T 


Thr 


Tryptophan 


W 


Trp 


Tyrosine 


Y 


Tyr 


Valine 


V 


Val 
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30 



35 



40 



45 



50 



55 



[0069] As mentioned above examples of the immunogenic detoxified protein that can be used in the present invention 
include polypeptides with minor amino acid variations from the natural amino acid sequence of the protein other than 
at the sites of mutation specifically mentioned. 

[0070] A significant advantage of producing the immunogenic detoxified protein by recombinant DNA techniques 
rather than by isolating and purifying a protein from natural sources is that equivalent quantities of the protein can be 
produced by using less starting material than would be required for isolating the protein from a natural source. Producing 
the protein by recombinant techniques also permits the protein to be isolated in the absence of some molecules normally 
present in cells. Indeed, protein compositions entirely free of any trace of human protein contaminants can readily be 
produced because the only human protein produced by the recombinant non-human host is the recombinant protein 
at issue. Potential viral agents from natural sources and viral components pathogenic to humans are also avoided. 
Also, genetically detoxified toxin are less likely to revert to a toxic from than more traditional, chemically detoxified toxins. 
[0071] Pharmaceutical ly acceptable carriers include any carrier that does not itself induce the production of antibod- 
ies harmful to the individual receiving the composition. Suitable carriers are typically large, slowly metabolized mac- 
romolecules such as proteins, polysaccharides, polylactic acids, polyglycolic acids, polymeric amino acids, amino acid 
copolymers, lipid aggregates (such as oil droplets or liposomes) and inactive virus particles. Such carriers are well 
known to those of ordinary skill in the art. Additionally, these carriers may function as immunostimulating agents (ad- 
juvants). 

[0072] Preferred adjuvants to enhance effectiveness of the composition include, but are not limited to: aluminum 
salts (alum) such as aluminium hydroxide, aluminium phosphate, aluminium sulfate etc., oil emulsion formulations, 
with or without other specific immunostimulating agents such as muramyl peptides or bacterial cell wall components, 
such as for example (1) MF59 (Published International patent application WO-A-90/14837, containing 5% Squalene, 
0.5% Tween® 80, 0.5% Span® 85 (optionally containing various amounts of MTP-PE (see below), although not re- 
quired) formulated into submicron particles using a microfluidizer such as Model 110Y microfluidizer (Microti uidics, 
Newton, MA 02164), (2) SAF, containing 10% squalene, 0.4% Tween 80, 5% pluronic-blocked polymer L1 21 , and thr- 
MDP (see below) either microfluidized into a submicron emulsion or vortexedto generate a larger particle size emulsion, 
and (3) RIBI' adjuvant system (RAS) (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 0.2% TweerKB>80 
and one or more bacterial cell wall components from the group consisting of monophosphoryl lipid A (MPL), trehalose 
dimycolate (TDM), and cell wall skeleton (CWS) preferably MPL+CWS (Detox™), muramyl peptides such as N-acetyl- 
muramyl-L-threonyl-D-isoglutamine (thr-MDP), N-acetyl-normuramyl-L-alanyl-D-iso-glutamine (nor-MDP), N-acetyl- 
muramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1'-2 , -dipalmrtoyl-srhglycero-3-hydroxyphosphoryloxy)-ethylamine 
(MTP-PE) etc., and cytokines, such as interleukins (IL-1, IL-2 etc) macrophage colony stimulating factor (M-CSF), 
tumour necrosis factor (TNF) etc. Additionally, saponin adjuvants, such as Stimulon™ (Cambridge Bioscience, Worces- 
ter, MA) may be used or particles generated therefrom such as ISCOMS (immunostimulating complexes). Furthermore, 
Complete Freunds Adjuvant (CFA) and Incomplete Freunds Adjuvant (I FA) may be used. Alum and MF59 are preferred. 
[0073] The immunogenic compositions (e.g. the antigen, pharmaceutical^ acceptable carrier and adjuvant) typically 
will contain diluents, such as water, saline, glycerol, ethanol, etc. Additionally, auxiliary substances, such as wetting 
or emulsifying agents, pH buffering substances, and the like, may be present in such vehicles. 
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[0074] Typically, the immunogenic compositions are prepared as injectables, either as liquid solutions or suspen- 
sions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection may also be prepared. The 
preparation also may be emulsified or encapsulated in liposomes for enhanced adjuvant effect as discussed above 
under pharmaceutically acceptable carriers. 

s [0075] Immunogenic compositions used as vaccines comprise an immunologically effective amount of the antigenic 
polypeptides, as well as any other of the above-mentioned components, as needed. By 'immunologically effective 
amount", it is meant that the administration of that amount to an individual, either in a single dose or as part of a series, 
is effective for treatment or prevention. This amount varies depending upon the health and physical condition of the 
individual to be treated, the taxonomic group of individual to be treated (e.g., nonhuman primate, primate, etc.), the 

10 capacity of the individual's immune system to synthesize antibodies, the degree of protection desired, the formulation 
of the vaccine, the treating doctor's assessment of the medical situation, and other relevant factors. It is expected that 
the amount will fall in a relatively broad range that can be determined through routine trials. 

[0076] The immunogenic compositions are conventionally administered parenterally, e.g. by injection either subcu- 
taneously or intramuscularly. Additional formulations suitable for other modes of administration include oral and pul- 

15 monary formulations, suppositories and transdermal applications. Dosage treatment may be a single dose schedule 
or a multiple dose schedule. The vaccine may be administered in conjunction with other immunoregulatory agents. 
[0077] The term "recombinant polynucleotide" as used herein intends a polynucleotide of genomic, cDNA, semisyn- 
thetic, or synthetic origin which, by virtue of its origin or manipulation: (1) is not associated with all or a portion of a 
polynucleotide with which it is associated in nature, (2) is linked to a polynucleotide other than that to which it is linked 

20 jn nature, or (3) does not occur in nature. 

[0078] The term "polynucleotide" as used herein refers to a polymeric form of nucleotides of any length, either ribo- 
nucleotides or deoxyribonucleotkJes. This term refers only to the primary structure of the molecule. Thus, this term 
includes double- and single-stranded DN A and RN A. It also includes known types of modifications, for example, labels 
which are known in the art, methylation, "caps", substitution of one or more of the naturally occurring nucleotides with 

25 an analog, internucleotide modifications such as, for example, those with uncharged linkages (e.g., methyl phospho- 
nates, phosphotriesters, phosphoamidates, carbamates, etc.) and with charged linkages (e.g., phosphoroth bates, 
phosphorodithioates, etc.), those containing pendant moieties, such as, for example proteins (including for e.g., nu- 
cleases, toxins, antibodies, signal peptides, pory-L-lysine, etc.), those with intercalators (e.g., acndine, psoralen, etc.), 
those containing chelators (e.g., metals, radioactive metals, boron, oxidative metals, etc.), those containing alkylators, 

30 those with modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of the polynucle- 
otide. 

[0079] A "replicon" is any genetic element, e.g., a plasmid, a chromosome, a virus, a cosmid, etc. that behaves as 
an autonomous unit of polynucleotide replication within a cell; i.e., capable of replication under its own control. This 
may include selectable markers. 
35 [0080] A "vector" is a replicon in which another polynucleotide segment is attached, so as to bring about the replication 
and/or expression of the attached segment. 

[0081 ] "Control sequence" refers to polynucleotide sequences which are necessary to effect the expression of coding 
sequences to which they are ligated. The nature of such control sequences differs depending upon the host organism; 
in prokaryotes, such control sequences generally include promoter, ribosomal binding site, and transcription termination 
40 sequence; in eu karyotes, generally, such control sequences include promoters and transcription termination sequence. 
The term "control sequences" is intended to include, at a minimum, all components whose presence is necessary for 
expression, and may also include additional components whose presence is advantageous, for example, leader se- 
quences and fusion partner sequences. 

[0082] "Operably linked" refers to a juxtaposition wherein the components so described are in a relationship permit- 
45 ting them to function in their intended manner. A control sequence "operably linked" to a coding sequence is ligated in 
such a way that expression of the coding sequence is achieved under conditions compatible with the control sequences. 
[0083] An "open reading frame" (ORF) is a region of a polynucleotide sequence which encodes a polypeptide; this 
region may represent a portion of a coding sequence or a total coding sequence. 

[0084] A "coding sequence" is a polynucleotide sequence which is translated into a polypeptide, usually via mRNA, 
50 when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are de- 
termined by a translation start codon at the 5'-terminus and a translation stop codon at the 3'-tenminus. A coding 
sequence can include, but is not limited to, cDNA, and recombinant polynucleotide sequences. 
[0085] "PCR" refers to the technique of polymerase chain reaction as described in Saiki, et aL, Nature 324:163 
(1986); and Scharf et aL, Science (1986) 233:1076-1078; and U.S. 4,683,195; and U.S. 4,683,202. 
55 [0086] As used herein, x is "heterologous" with respect to y if x is not naturally associated with y in the identical 
manner; i.e., x is not associated with y in nature or x is not associated with y in the same manner as is found in nature. 
[0087] "Homology" refers to the degree of similarity between x and y. The correspondence between the sequence 
from one form to another can be determined by techniques known in the art. For example, they can be determined by 



7 



EP 0 620 850 B1 

a direct comparison of the sequence information of the polynucleotide. Alternatively, homology can be determined by 
hybridization of the polynucleotides under conditions which form stable duplexes between homologous regions (for 
example, those which would be used prior to S 1 digestion), followed by digestion with single-stranded specific nuclease 
(s), followed by size determination of the digested fragments. 

s [0088] As used herein, the term 'polypeptide' refers to a polymer of amino acids and does not refer to a specific 
length of the product; thus, peptides, oligopeptides, and proteins are included within the definition of polypeptide. This 
term also does not refer to or exclude post expression modifications of the polypeptide, for example, glycosylations, 
acetylations, phosphorylations and the like. Included within the definition are, for example, polypeptides containing 
one or more analogs of an amino acid (including, for example, unnatural amino acids, etc.), polypeptides with substi- 

10 tuted linkages, as well as other modifications known in the art, both naturally occurring and non-naturally occurring. 
[0089] A polypeptide or amino acid sequence 'derived from" a designated nucleic acid sequence refers to a polypep- 
tide having an amino acid sequence identical to that of a polypeptide encoded in the sequence, or a portion thereof 
wherein the portion consists of at least 3-5 amino acids, and more preferably at least 8-10 amino acids, and even more 
preferably at least 11-15 amino acids, or which is immunologically identifiable with a polypeptide encoded in the se- 

is quence. This terminology also includes a polypeptide expressed from a designated nucleic acid sequence. 

[0090] The protein may be used for producing antibodies, either monoclonal or polyclonal, specific to the protein. 
The methods for producing these antibodies are known in the art. 

[0091] 'Recombinant host cells', 'host cells," "cells," 'cell cultures,' and other such terms denote, for example, mi- 
croorganisms, insect cells, and mammalian cells, that can be, or have been, used as recipients for recombinant vector 
20 or other transfer DNA, and include the progeny of the original cell which has been transformed. It is understood that 
the progeny of a single parental cell may not necessarily be completely identical in morphology or in genomic or total 
DNA complement as the original parent, due to natural, accidental, or deliberate mutation. Examples for mammalian 
host cells include Chinese hamster ovary (CHO) and monkey kidney (COS) cells. 

[0092] Specifically, as used herein, "cell line,' refers to a population of cells capable of continuous or prolonged 
2S growth and division in vitro. Often, cell lines are clonal populations derived from a single progenitor cell. It is further 
known in the art that spontaneous or induced changes can occur in karyotype during storage or transfer of such clonal 
populations. Therefore, cells derived from the cell line referred to may not be precisely identical to the ancestral cells 
or cultures, and the cell line referred to includes such variants. The term "cell lines" also includes immortalized cells. 
Preferably, cell lines include nonhybrid cell lines or hybridomas to only two cell types. 
30 [0093] As used herein, the term "microorganism" includes prokaryotic and eukaryotic microbial species such as 
bacteria and fungi, the latter including yeast and filamentous fungi. 

[0094] "Transformation', as used herein, refers to the insertion of an exogenous polynucleotide into a host cell, 
irrespective of the method used for the insertion, for example, direct uptake, transduction, f-mating or electroporation. 
The exogenous polynucleotide may be maintained as a non-integrated vector, for example, a plasmid, or alternatively, 

35 may be integrated into the host genome. 

[0095] By "genomic" is meant a collection or library of DNA molecules which are derived from restriction fragments 
that have been cloned in vectors. This may include all or part of the genetic material of an organism. 
[0096] By "cDNA" is meant a complementary DNA sequence that hybridizes to a complementary strand of DNA. 
[0097] By "purified" and "isolated" is meant, when referring to a polypeptide or nucleotide sequence, that the indicated 

40 molecule is present in the substantial absence of other biological macromolecules of the same type. The term "purified" 
as used herein preferably means at least 75% by weight, more preferably at least 85% by weight, more preferably still 
at least 95% by weight, and most preferably at least 98% by weight, of biological macromolecules of the same type 
present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000, can be present). 

4£ [0098] Once the appropriate coding sequence is isolated, it can be expressed in a variety of different expression 
systems; for example those used with mammalian cells, baculoviruses, bacteria, and yeast. 

i. Mammalian Systems 

so [0099] Mammalian expression systems are known in the art. A mammalian promoter is any DNA sequence capable 
of binding mammalian RNA polymerase and initiating the downstream (3*) transcription of a coding sequence (e.g. 
structural gene) into mRNA. A promoter will have a transcription initiating region, which is usually placed proximal to 
the 5' end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream of the transcription 
initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA synthesis at the correct site. A 

55 mammalian promoter will also contain an upstream promoter element, usually located within 100 to 200 bp upstream 
of the TATA box. An upstream promoter element determines the rate at which transcription is initiated and can act in 
either orientation [Sambrook et al. (1989) "Expression of Cloned Genes in Mammalian Cells." In Molecular Cloning: A 
Laboratory Manual, 2nd ed.]. 
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[0100] Mammalian viral genes are often highly expressed and have a broad host range; therefore sequences en- 
coding mammalian viral genes provide particularty useful promoter sequences. Examples include the SV40 early pro- 
moter, mouse mammary tumor virus LTR promoter, adenovirus major late promoter (Ad MLP), and herpes simplex 
virus promoter. In addition, sequences derived from non-viral genes, such as the murine metallotheionein gene, also 
5 provide useful promoter sequences. Expression may be either constitutive or regulated (inducible), depending on the 
promoter can be induced with glucocorticoid in hormone-responsive cells. 

[0101] The presence of an enhancer element (enhancer), combined with the promoter elements described above, 
will usually increase expression levels. An enhancer is a regulatory DNA sequence that can stimulate transcription up 
to 1000-fold when linked to homologous or heterologous promoters, with synthesis beginning at the normal RNA start 

10 site. Enhancers are also active when they are placed upstream or downstream from the transcription initiation site, in 
either normal or flipped orientation, or at a distance of more than 1000 nucleotides from the promoter [Maniatis et al. 
(1987) Science 236 :1237; Alberts et al. (1989) Molecular Biology of the Cell, 2nd ed.]. Enhancer elements derived 
from viruses may be particularly useful, because they usually have a broader host range. Examples include the SV40 
early gene enhancer [Dijkema et al (1 985) EMBO J. 4 :7611 and the enhancer/promoters derived from the long terminal 

15 repeat (LTR) of the Rous Sarcoma Virus [Gorman et al. (1982b) Proc. Natl. Acad. Sci. 79 :6777] and from human 
cytomegalovirus [Boshart etal. (1985) Cell 41 :5211. Additionally, some enhancers are regulatable and become active 
only in the presence of an inducer, such as a hormone or metal ion [Sassone-Corsi and Borelli (1 986) Trends Genet. 
2:215; Maniatis et al. (1987) Science 236:1237]. 

[0102] A DNA molecule may be expressed intracellularly in mammalian cells. A promoter sequence may be directly 
20 linked with the DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will 
always be a methionine, which is encoded by the ATG start codon. If desired, the N-terminus may be cleaved from the 
protein by in vitro incubation with cyanogen bromide. 

[0103] Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric 
DNA molecules that encode a fusion protein comprised of a leader sequence fragment that provides for secretion of 
25 the foreign protein in mammalian cells. Preferably, there are processing sites encoded between the leader fragment 
and the foreign gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a 
signal peptide comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. The ade- 
novirus triparrte leader is an example of a leader sequence that provides for secretion of a foreign protein in mammalian 
cells. 

30 [0104] Usually, transcription termination and polyadenylation sequences recognized by mammalian cells are regu- 
latory regions located 3' to the translation stop codon and thus, together with the promoter elements, flank the coding 
sequence. The 3' terminus of the mature mRNA is formed by site-specific post-transcriptional cleavage and polyade- 
nylation [Birnstiel et al. (1985) Cell 41 :349; Proudfoot and Whitelaw (1988) "Termination and 3' end processing of 
eukaryotic RNA. In Transcription and splicing (ed. B.D. Hames and D.M. Glover); Proudfoot M989) Trends Biochem. 

35 Sci. 14 :105], These sequences direct the transcription of an mRNA which can be translated into the polypeptide en- 
coded by the DNA. Examples of transcription terminater/polyadenylation signals include those derived from SV40 
[Sambrook et al (1 989) ■Expression of cloned genes in cultured mammalian cells/ In Molecular Cloning: A Laboratory 
Manual]. 

[01 05] Some genes may be expressed more efficiently when introns (also called intervening sequences) are present. 

40 Several cDNAs, however, have been efficiently expressed from vectors that lack splicing signals (also called splice 
donor and acceptor sites) [see e.g., Gothing and Sambrook (1981) Nature 293 : 620]. Introns are intervening noncoding 
sequences within a coding sequence that contain splice donor and acceptor sites. They are removed by a process 
called "splicing,' following polyadenylation of the primary transcript [Nevins (1983) Annu. Rev. Biochem. 52 :441 ; Green 
(1986) Annu. Rev. Genet. 20 :671; Padgett et al. (1986) Annu. Rev. Biochem. 55 :1119; Krainer and Maniatis (1988) 

45 "RNA splicing." In Transcription and splicing (ed. B.D. Hames and D.M. Glover)]. 

[0106] Usually, the above described components, comprising a promoter, polyadenylation signal, and transcription 
termination sequence are put together into expression constructs. Enhancers, introns with functional splice donor and 
acceptor sites, and leader sequences may also be included in an expression construct, if desired. Expression constructs 
are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable mainte- 

50 nance in a host, such as mammalian cells or bacteria. Mammalian replication systems include those derived from 
animal viruses, which require trans-acting factors to replicate. For example, plasmids containing the replication systems 
of papovaviruses, such as SV40 [Gluzman (1981) Cell 23 :175] or polyomavirus, replicate to extremely high copy 
number in the presence of the appropriate viral T antigen. Additional examples of mammalian replicons include those 
derived from bovine papillomavirus and Epstein-Barr virus. Additionally, the replicon may have two replication systems, 

55 thus allowing it to be maintained, for example, in mammalian cells for expression and in a procaryotic host for cloning 
and amplification. Examples of such mammalian-bacteria shuttle vectors include pMT2 [Kaufman et al. (1989) Mol. 
Cell. Biol. 9 :946 and pHEBO [Shimizu et al. (1986) Mol. Cell. Biol. 6 :10741. 

[0107] The transformation procedure used depends upon the host to be transformed. Methods for introduction of 
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heterologous polynucleotides into mammalian cells are known in the art and include dextran-mediated transfection, 
calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, encapsulation of 
the polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

[0108] Mammalian cell lines available as hosts for expression are known in the art and include many immortalized 
s cell lines available from the American Type Culture Collection (ATCC), including but not limited to, Chinese hamster 
ovary (CHO) cells, HeLa cells, baby hamster kidney (BHK) cells, monkey kidney cells (COS), human hepatocellular 
carcinoma cells (e.g., Hep G2), and a number of other cell lines. 

ii. Baculovirus Systems 

10 

[0109] The polynucleotide encoding the protein can also be inserted into a suitable insect expression vector, and is 
operably linked to the control elements within that vector. Vector construction employs techniques which are known in 
the art. 

[0110] Generally, the components of the expression system include a transfer vector, usually a bacterial plasmid, 
is which contains both a fragment of the baculovirus genome, and a convenient restriction site for insertion of the heter- 
ologous gene or genes to be expressed; a wild type baculovirus with a sequence homologous to the baculovirus- 
specific fragment in the transfer vector (this allows for the homologous recombination of the heterologous gene in to 
the baculovirus genome); and appropriate insect host cells and growth media. 

[0111] After inserting the DNA sequence encoding the protein into the transfer vector, the vector and the wild type 
20 viral genome are transfected into an insect host cell where the vector and viral genome are allowed to recombine. The 
packaged recombinant virus is expressed and recombinant plaques are identified and purified. Materials and methods 
for bacubvirus/insect cell expression systems are commercially available in kit form from, inter alia. Invitrogen, San 
Diego CA ("MaxBac" kit). These techniques are generally known to those skilled in the art and fully described in Sum- 
mers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1 987) (hereinafter "Summers and Smith"). 
25 [0112] Prior to inserting the DNA sequence encoding the protein into the baculovirus genome, the above described 
components, comprising a promoter, leader (if desired), coding sequence of interest, and transcription termination 
sequence, are usually assembled into an intermediate transplacement construct (transfer vector). Tbis construct may 
contain a single gene and operably linked regulatory elements; multiple genes, each with its owned set of operably 
linked regulatory elements; or multiple genes, regulated by the same set of regulatory elements. Intermediate trans- 
30 placement constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) ca- 
pable of stable maintenance in a host, such as a bacterium. The replicon will have a replication system, thus allowing 
it to be maintained in a suitable host for cloning and amplification. 

[011 3] Currently, the most commonly used transfer vector for introducing foreign genes into AcNP V is pAc373. Many 
other vectors, known to those of skill in the art, have also been designed. These include, for example, pVL985 (which 
35 alters the polyhedrin start codon from ATG to ATT, and which introduces a BamHI cloning site 32 base pairs downstream 
from the ATT; see Luckow and Summers, Virology (1 989) 17: 31 . 

[011 4] The plasmid usually also contains the polyhedrin polyadenylation signal (Miller et al. (1 988) Ann. Rev. Micro- 
biol., 42 :177) and a procaryotic ampicillin-resistance (amp) gene and origin of replication for selection and propagation 
in E. coli. 

40 [0115] Baculovirus transfer vectors usually contain a baculovirus promoter. A baculovirus promoter is any DNA se- 
quence capable of binding a baculovirus RNA polymerase and initiating the downstream (5' to 3') transcription of a 
coding sequence (e.g. structural gene) into mRN A. A promoter will have a transcription initiation region which is usually 
placed proximal to the 5* end of the coding sequence. This transcription initiation region usually includes an RNA 
polymerase binding site and a transcription initiation site. A baculovirus transfer vector may also have a second domain 

45 called an enhancer, which, if present, is usually distal to the structural gene. Expression may be either regulated or 
constitutive. 

[0116] Structural genes, abundantly transcribed at late times in a viral infection cycle, provide particularly useful 
promoter sequences. Examples include sequences derived from the gene encoding the viral polyhedron protein, 
Friesen et al., (1986) The Regulation of Baculovirus Gene Expression," in: The Molecular Biology of Baculoviruses 
50 (ed. Walter Doerfler); EPO Publ. Nos. 1 27 839 and 1 55 476; and the gene encoding the p1 0 protein, Vlak et al., (1 988), 
J. Gen. Virol. 69 :765. 

[0117] DNA encoding suitable signal sequences can be derived from genes for secreted insect or baculovirus pro- 
teins, such as the baculovirus polyhedrin gene (Carbonell et al. (1988) Gene, 73:409). Alternatively, since the signals 
for mammalian cell posttranslational modifications (such as signal peptide cleavage, proteolytic cleavage, and phos- 
55 phorylation) appear to be recognized by insect cells, and the signals required for secretion and nuclear accumulation 
also appear to be conserved between the invertebrate cells and vertebrate cells, leaders of non-insect origin, such as 
those derived from genes encoding human a-interferon, Maeda etal., (1 985), Nature 31 5 :592; human gastrin-releasing 
peptide, Lebacq-Verheyden et al., (1988), Molec. Cell. Biol. 8:3129: human IL-2, Smith etal., (1985) Proc. Nat'l Acad. 
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Sci. USA, 82:B404; mouse IL-3, (Miyajima et aL, (1987) Gene 58:273; and human glucocerebrosidase, Martin et al. 

(1 988) DNA, 7:99, can also be used to provide lor secretion in insects. 

[011 8] A recombinant polypeptide or polyprotein may be expressed intracellular^ or, if it is expressed with the proper 
regulatory sequences, it can be secreted. Good intracellular expression of nonfused foreign proteins usually requires 
s heterologous genes that ideally have a short leader sequence containing suitable translation initiation signals preceding 
an ATG start signal. If desired, methionine at the N-terminus may be cleaved from the mature protein by in vitro incu- 
bation with cyanogen bromide. 

[0119] Alternatively, recombinant polyproteins or proteins which are not naturally secreted can be secreted from the 
insect cell by creating chimeric DNA molecules that encode a fusion protein comprised of a leader sequence fragment 
10 that provides for secretion of the foreign protein in insects. The leader sequence fragment usually encodes a signal 
peptide comprised of hydrophobic amino acids which direct the translocation of the protein into the endoplasmic retic- 
ulum. 

[01 20] After insertion of the DNA sequence and/or the gene encoding the expression product precursor of the protein, 
an insect cell host is co-transformed with the heterologous DNA of the transfer vector and the genomic DNA of wild 
is type baculovirus ~ usually by co-transfection. The promoter and transcription termination sequence of the construct 
will usually comprise a 2-5 kb section of the baculovirus genome. 

[0121] Methods for introducing heterologous DNA into the desired site in the baculovirus virus are known in the art. 
(See Summers and Smith supra ; Ju et al. (1 987); Smith et al. t Mol. Cell. Biol. (1 983) 3:21 56; and Luckow and Summers 

(1 989) ). For example, the insertion can be into a gene such as the polyhedrin gene, by homologous double crossover 
20 recombination; insertion can also be into a restriction enzyme site engineered into the desired baculovirus gene. Miller 

et al., (1989), Bioessays 4:91 .The DNA sequence, when cloned in place of the polyhedrin gene in the expression 
vector, is flanked both 5' and 3 1 by polyhedrin-specrfic sequences and is positioned downstream of the polyhedrin 
promoter. 

[0122] The newly formed baculovirus expression vector is subsequently packaged into an infectious recombinant 
25 baculovirus. Homologous recombination occurs at tow frequency (between about 1 % and about 5%); thus, the majority 
of the virus produced after cotransfection is still wild-type virus. Therefore, a method is necessary to identify recombinant 
viruses. An advantage of the expression system is a visual screen allowing recombinant viruses to be distinguished. 
The polyhedrin protein, which is produced by the native virus, is produced at very high levels in the nuclei of infected 
cells at late times after viral infection. Accumulated polyhedrin protein forms occlusion bodies that also contain em- 
30 bedded particles. These occlusion bodies, up to 15 um in size, are highly retractile, giving them a bright shiny appear- 
ance that is readily visualized under the light microscope. Cells infected with recombinant viruses lack occlusion bodies. 
To distinguish recombinant virus from wild-type virus, the transfection supernatant is plaqued onto a monolayer of 
insect cells by techniques known to those skilled in the art. Namely, the plaques are screened under the light microscope 
for the presence (indicative of wild-type virus) or absence (indicative of recombinant virus) of occlusion bodies. "Current 
35 Protocols in Microbiology' Vol. 2 (Ausubel et al. eds) at 16.8 (Supp. 10, 1990); Summers and Smith, supra ; Miller et 
al. (1989). 

[0123] Recombinant baculovirus expression vectors have been developed for infection into several insect cells. For 
example, recombinant baculoviruses have been developed for, inter alia : Aedes aegypti , Autographa califomica , Bom- 
byx mori , Drosophila melanogaster , Spodoptera frugiperda, and Trtchoplusia ni (PCT Pub. No. WO 89/046699; Car- 
40 bonell etal., (1985) J. Virol. 56 :153; Wright (1986) Nature 321 :718; Smith etal., (1983) Mol. Cell. Biol. 3:2156; and 
see generally, Fraser, et al. (1989) In Vitro Cell. Dev. Biol. 25 :225). 

[0124] Cells and cell culture media are commercially available for both direct and fusion expression of heterologous 
polypeptides in a baculovir us/expression system; cell culture technology is generally known to those skilled in the art. 
See, e.g. , Summers and Smith supra . 

45 [0125] The modified insect cells may then be grown in an appropriate nutrient medium, which allows for stable main- 
tenance of the plasmid(s) present in the modified insect host. Where the expression product gene is under inducible 
control, the host may be grown to high density, and expression induced. Alternatively, where expression is constitutive, 
the product will be continuously expressed into the medium and the nutrient medium must be continuously circulated, 
while removing the product of interest and augmenting depleted nutrients. The product may be purified by such tech- 

50 piques as chromatography, e.g., HPLC, affinity chromatography, ion exchange chromatography, etc.; electrophoresis; 
density gradient centrifugation; solvent extraction, or the like. As appropriate, the product may be further purified, as 
required, so as to remove substantially any insect proteins which are also secreted in the medium or result from lysis 
of insect cells, so as to provide a product which is at least substantially free of host debris, e.g., proteins, lipids and 
polysaccharides. 

ss [0126] In order to obtain protein expression, recombinant host cells derived from the transformants are incubated 
under conditions which allow expression of the recombinant protein encoding sequence. These conditions will vary, 
dependent upon the host cell selected. However, the conditions are readily ascertainable to those of ordinary skill in 
the art, based upon what is known in the art. 
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iii. Bacterial Systems 

[0127] Bacterial expression techniques are known in the art. A bacterial promoter is any DNA sequence capable of 
binding bacterial RN A polymerase and initiating the downstream (3") transcription of a coding sequence (e.g. structural 

5 gene) into mRNA. A promoter will have a transcription initiation region which is usually placed proximal to the 5* end 
of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site and a 
transcription initiation site. A bacterial promoter may also have a second domain called an operator, that may overlap 
an adjacent RNA polymerase binding site at which RNA synthesis begins. The operator permits negative regulated 
(inducible) transcription, as a gene repressor protein may bind the operator and thereby inhibit transcription of a specific 

to gene. Constitutive expression may occur in the absence of negative regulatory elements, such as the operator. In 
addition, positive regulation may be achieved by a gene activator protein binding sequence, which, if present is usually 
proximal (5') to the RNA polymerase binding sequence. An example of a gene activator protein is the catabolite activator 
protein (CAP), which helps initiate transcription of the lac operon in Escherichia coli (E. coli) [Raibaud et al. (1984) 
Annu. Rev. Genet. 18 : 1 73]. Regulated expression may therefore be either positive or negative, thereby either enhancing 

is or reducing transcription. 

[01 28] Sequences encoding metabolic pathway enzymes provide particularly useful promoter sequences. Examples 
include promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose (lac) [Chang et al. 
O 977 ) Nature 198 :10561, and maltose. Additional examples include promoter sequences derived from biosynthetic 
enzymes such as tryptophan (trg) [Goeddel et al. (1 980) Nuc. Acids Res. 8 :4057: Yelverton et al. (1 981 ) Nucl. Acids 

20 Res. 9:731 ; U.S. Patent No. 4,738,921 ; EPO Publ. Nos. 036 776 and 1 21 775]. The g-laotamase (bla) promoter system 
[Weissmann (1981) "The cloning of interferon and other mistakes." In Interferon 3 (ed. I. Gresser)], bacteriophage 
lambda PL [Shimatakeetal. (1 981) Nature 292 : 1 28] and T5 [U.S. Patent No. 4,689,406] promoter systems also provide 
useful promoter sequences. 

[01 29] In addition, synthetic promoters which do not occur in nature alsof unction as bacterial promoters. For example, 

25 transcription activation sequences of one bacterial or bacteriophage promoter may be joined with the operon sequences 
of another bacterial or bacteriophage promoter, creating a synthetic hybrid promoter [U.S. Patent No. 4,551,433]. For 
example, the tac promoter is a hybrid trp-lac promoter comprised of both trg promoter and lac operon sequences that 
is regulated by the tec repressor [Amann et aj. (1 983) Gene 25 : 1 67; de Boer et al. (1 983) Proc. Natl. Acad. Sci. 80 : 
21]. Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin that have the 

30 ability to bind bacterial RNA polymerase and initiate transcription . A naturally occurring promoter of non-bacterial origin 
can also be coupled with a compatible RNA polymerase to produce high levels of expression of some genes in prokary- 
otes. The bacteriophase T7 RNA polymerase/promoter system is an example of a coupled promoter system [Studier 
et al. (1986) J. Mot. Biol. 189 :113; Tabor et al. (1985) Proc Natl. Acad. Sci. 82:1074]. In addition, a hybrid promoter 
can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. No. 267 851). 

35 [0130] In addition to a functioning promoter sequence, an efficient ribosome binding site is also useful for the ex- 
pression of foreign genes in prokaryotes. In E. coli, the ribosome binding site is called the Shine-Dalgamo (SD) se- 
quence and includes an initiation codon (ATG) and a sequence 3-9 nucleotides in length located 3-11 nucleotides 
upstream of the initiation codon [Shine et al. (1 975) Nature 254 :341 The SD sequence is thought to promote binding 
of mRNA to the ribosome by the pairing of bases between the SD sequence and the 3 1 and of E. coli 16S rRNA [Steitz 

40 et aj. (1979) "Genetic signaJs and nucleotide sequences in messenger RNA.' In Biological Regulation and Develop- 
ment: Gene Expression (ed. R.F. Gddberger)]. To express eukaryotic genes and prokaryotic genes with weak ribos- 
ome-binding site [Sambrook et al. (1989) "Expression of cloned genes in Escherichia coli/ In Molecular Cloning: A 
Laboratory Manual]. 

[01 31] A DNA molecule may be expressed intracellular^. A promoter sequence may be directly linked with the DNA 
4£ molecule, in which case the first amino acid at the N -terminus will always be a methionine, which is encoded by the 
ATG start codon. If desired, methionine at the N-terminus may be cleaved from the protein by in vitro incubation with 
cyanogen bromide or by either in vivo on in vitro incubation with a bacterial methionine N-terminal peptidase (EPO 
Publ. No. 219 237). 

[01 32] Fusion proteins provide an alternative to direct expression. Usually, a DNA sequence encoding the N-terminal 
50 portion of an endogenous bacterial protein, or other stable protein, is fused to the 5* end of heterologous coding se- 
quences. Upon expression, this construct will provide a fusion of the two amino acid sequences. For example, the 
bacteriophage lambda cell gene can be linked at the 5* terminus of a foreign gene and expressed in bacteria. The 
resulting fusion protein preferably retains a site for a processing enzyme (factor Xa) to cleave the bacteriophage protein 
from the foreign gene [Nagai et aj. (1984) Nature 309 :81 0]. Fusion proteins can also be made with sequences from 
55 the lacZ [Jia et al. (1987) Gene 60:197], trpE [Allen et al. (1987) J. Biotechnol. 5:93; Makoff et al. (1989) J. Gen. 
Microbiol. 135 : 1 1 ], and Chey [EPO Publ. No. 324 647] genes. The DNA sequence at the junction of the two amino acid 
sequences may or may not encode a cleavable site. Another example is a ubiquitin fusion protein. Such a fusion protein 
is made with the ubiquitin region that preferably retains a site for a processing enzyme (e.g. ubiquitin specific process- 
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ing-protease) to cleave the ubiqurtin from the foreign protein. Through this method, native foreign protein can be isolated 
[Miller etaj. (1989) BkVTechnoloqy 7:6981. 

[0133] Alternatively, foreign proteins can also be secreted from the cell by creating chimeric DNA molecules that 
encode a fusion protein comprised of a signal peptide sequence fragment that provides for secretbn of the foreign 

s protein in bacteria [U.S. Patent No. 4,336,336]. The signal sequence fragment usually encodes a signal peptide com- 
prised of hydrophobic amino acids which direct the secretion of the protein from the cell. The protein is either secreted 
into the growth media (gram-positive bacteria) or into the periplasm ic space, located between the inner and outer 
membrane of the cell (gram-negative bacteria). Preferably there are processing sites, which can be cleaved either jn 
vivo or in vitro encoded between the signal peptide fragment and the foreign gene. 

10 [01 34] DNA encoding suitable signal sequences can be derived from genes for secreted bacterial proteins, such as 
the E. coli outer membrane protein gene (ompA) [Masui et al. (1 983), in: Experimental Manipulation of Gene Expression; 
Ghrayeb et al. (1 984) EM BO J. 3:2437] and the E. coli alkaline phosphatase signal sequence (phoA) [Oka et al. (1985) 
Proc. Natl. Acad. Sci. 82 :721 2]. As an additional example, the signal sequence of the alpha-amylase gene from various 
Bacillus strains can be used to secrete heterologous proteins f rom B. subtilis [Palva et a}. (1 982) Proc. Natl. Acad. Sci. 

is USA 79 :5582; EPO Publ. No. 244 042]. 

[0135] Usually, transcription termination sequences recognized by bacteria are regulatory regions located 3' to the 
translation stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. Transcription termination 
sequences frequently include DNA sequences of about 50 nucleotides capable of forming stem loop structures that 

20 aid in terminating transcription. Examples include transcription termination sequences derived from genes with strong 
promoters, such as the trp gene in E_ coli as well as other biosynthetic genes. 

[0136] Usually, the above described components, comprising a promoter, signal sequence (if desired), coding se- 
quence of interest, and transcription termination sequence, are put together into expression constructs. Expression 
constructs are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable 

2S maintenance in a host, such as bacteria. The replicon will have a replication system, thus allowing it to be maintained 
in a procaryotic host either for expression or for cloning and amplification. In addition, a replicon may be either a high 
or low copy number plasmid. A high copy number plasm id will generally have a copy number ranging from about 5 to 
about 200, and usually about 10 to about 150. A host containing a high copy number plasmid will preferably contain 
at least about 10, and more preferably at least about 20 plasmids. Either a high or low copy number vector may be 

30 selected, depending upon the effect of the vector and the foreign protein on the host. 

[01 37] Alternatively, the expression constructs can be integrated into the bacterial genome with an integrating vector. 
[01 38] I ntegrating vectors usually contain at least one sequence homologous to the bacterial chromosome that allows 
the vector to integrate. Integrations appear to result from recombinations between homologous DNA in the vector and 
the bacterial chromosome. For example, integrating vectors constructed with DNA from various Bacillus strains inte- 

35 grate into the Bacillus chromosome (EPO Publ. No. 127 328). Integrating vectors may also be comprised of bacteri- 
ophage or transposon sequences. 

[0139] Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow 
for the selection of bacterial strains that have been transformed. Selectable markers can be expressed in the bacterial 
host and may include genes which render bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 

40 kanamycin (neomycin), and tetracycline [Davies et al. (1978) Annu. Rev. Microbiol. 32 :469]. Selectable markers may 
also include biosynthetic genes, such as those in the histidine, tryptophan, and leucine biosynthetic pathways. 
[01 40] Alternatively, some of the above described components can be put together in transformation vectors. Trans- 
formation vectors are usually comprised of a selectable market that is either maintained in a replicon or developed into 
an integrating vector, as described above. 

45 [0141] Expression and transformation vectors, either extrachromosomal replicons or integrating vectors, have been 
developed for transformation into many bacteria. For example, expression vectors have been developed for, inter alia, 
the following bacteria: Bacillus subtilis [Palva et al. (1982) Proc. Natl. Acad. Sci. USA 79:5582; EPO Publ. Nos. 036 
259 and 063 953; PCT Publ. No. WO 84/04541], Escherichia coli [Shimatake et al. (1981) Nature 292:128; Amann et 
al. (1985) Gene 40:183; Studieret aj. (1986) J. Mol. Biol. 189 :113; EPO Publ. Nos. 036 776, 136 829 and 136 907], 

so Streptococcus cremoris [Powell et al. (1 988) Ap_pj. Environ. Microbiol. 54:655]; Streptococcus lividans [Powell et al. 
(1988) Appl. Environ. Microbiol. 54:655], Streptomyces lividans [U.S. Patent No. 4,745,056]. 
[0142] Methods of introducing exogenous DNA into bacterial hosts are well-known in the art, and usually include 
either the transformation of bacteria treated with CaClg or other agents, such as divalent cations and DMSO. DNA can 
also be introduced into bacterial cells by electroporation. Transformation procedures usually vary with the bacterial 

ss species tobe transformed. See e.g., [Masson et aL (1989) FEMS Microbiol. Lett . 60:273; Palva et al. (1982) Proc. Natl. 
Acad. Sci. USA 79 :5582; EPO Publ. Nos. 036 259 and 063 953; PCT Publ. No. WO 84/04541 , Bacillus], [Miller et al. 
(1988) Proc. Natl. Acad. Sci. 85 :856; Wang et al. (1990) J. Bacterid. 172 :949, Campylobacter], [Cohen et al. (1973) 
Proc. Natl. Acad. Sci. 69 :2110; Dower etaj. (1988) Nucleic Acids Res. 16 :6127; Kushner(1978) 'An improved method 
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for transformation of Escherichia co!i with ColEI-derived plasmids. In Genetic Engineering: Proceedings of the Inter- 
national Symposium on Genetic Engineering (eds. H.W. Boyer and S. Nicosia); Mandel et aj. (1970) J. Mol. Biol. 53 : 
159; Taketo (1988) Biochim. Biophys. Acta 949 :318; Escherichia], [Chassy et al. (1987) FEMS Microbiol. Lett. 44 :173 
Lactobacillus]; [Fiedler et aj. (1988) Anal. Biochem 170 :38, Pseudomonas]; [Augustin et aj. (1990) FEMS Microbiol. 
5 Lett. 66:203, Staphylococcus], [Barany et aL (1980) J. Bacteriol. 144 :698; Harlander (1987) "Transformation of Strep- 
tococcus lactis by elect roporation, in. Streptococcal Genetics (ed. J. Ferretti and R. Curtiss III); Perry et al. (1981) 
Infec. Immun. 32 :1295; Powell etaj. (1988) Appl. Environ. Microbiol. 54 :655; Somkutietal. (1987) Proc. 4th Evr. Cong- 
Biotechnology 1 :41 2. Streptococcus]. 

10 iv. Yeast Expression 

[0143] Yeast expression systems are also known to one of ordinary skill in the art. A yeast promoter is any DNA 
sequence capable of binding yeast RNA polymerase and initiating the downstream (3') transcription of a coding se- 
quence (e.g. structural gene) into mRN A A promoter will have a transcription initiation region which is usually placed 

15 proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase 
binding site (the "TATA Box") and a transcription initiation site. A yeast promoter may also have a second domain called 
an upstream activator sequence (UAS), which, if present, is usually distal to the structural gene. The UAS permits 
regulated (inducible) expression. Constitutive expression occurs in the absence of a UAS. Regulated expression may 
be either positive or negative, thereby either enhancing or reducing transcription. 

20 [0144] Yeast is a fermenting organism with an active metabolic pathway, therefore sequences encoding enzymes in 
the metabolic pathway provide particularly useful promoter sequences. Examples include alcohol dehydrogenase 
(ADH) (EPO Publ. No. 284044), enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphate- 
dehydrogenase (GAP or GAPDH) , hexokinase, phosphofructokinase, 3-phosphoglycerate mutase, and pyruvate ki- 
nase (PyK) (EPO Publ. No. 329 203). The yeast PHQ5 gene, encoding acid phosphatase, also provides useful promoter 

25 sequences [Myanohara et al. (1 983) Proc. Natl. Acad. Sci. USA 80: 1 ]. 

[0145] In addition, synthetic promoters which do not occur in nature also function as yeast promoters. For example, 
UAS sequences of one yeast promoter may be joined with the transcription activation region of another yeast promoter, 
creating a synthetic hybrid promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked 
to the GAP transcription activation region (U.S. Patent Nos. 4,876,197 and 4,880,734). Other examples of hybrid pro- 

30 moters include promoters which consist of the regulatory sequences of either the ADH2 , GAL4 , GAL10, OR PHQ5 
genes, combined with the transcriptional activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. 
No. 164 556). Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin that have 
the ability to bind yeast RNA polymerase and initiate transcription. Examples of such promoters include, inter alia, 
[Cohen etaj. (1980) Proc. Natl. Acad. Sci. USA 77: 1078; Henikoff etaj. (1981) Nature 283 :835: Hollenberget al. (1981) 

35 Curr. Topics Microbiol. Immunol. 96: 119; Hollenberg et aj. (1979) "The Expression of Bacterial Antibiotic Resistance 
Genes i the Yeast Saccharomyces cerevisiae, - in: Plasmids of Medical Environmental and Commercial Importance 
(eds. K>N> Tlmmis and A. Puhler); Mercerau-Puigalon et al. (1980) Gene 11 :163; Panthier et aj. (1980) Curr. Genet. 
2:109;]. 

[0146] A DNA molecule may be expressed intracellular^ in yeast. A promoter sequence may be directly linked with 
40 the DNA molecule, in which case the first amino acid at the N-terminus of the recombinant protein will always be a 
methionine, which is encoded by the ATG start codon. If desired, methionine at the N-terminus may be cleaved from 
the protein by jn vitro incubation with cyanogen bromide. 

[0147] Fusion proteins provide an alternative for yeast expression systems, as well as in mammalian, baculovirus, 
and bacterial expression systems. Usually, a DNA sequence encoding the N-terminal portion of an endogenous yeast 

45 protein, or other stable protein, is fused to the 5' end of heterologous coding sequences. Upon expression, this construct 
will provide a fusion of the two amino acid sequences. For example, the yeast or human superoxide dismutase (SOD) 
gene, can be linked at the 5' terminus of a foreign gene and expressed in yeast. The DNA sequence at the junction of 
the two amino acid sequences may or may not encode a cleavable site. See e.g., EPO Publ. No. 196 056. Another 
example is a ubiquitin fusion protein. Such a fusion protein is made with the ubiquitin region that preferably retains a 

50 site for a processing enzyme (e.g. ubiquitin-specific processing protease) to cleave the ubiquitin from the foreign protein. 
Through this method, therefore, native foreign protein can be isolated (see, e.g., PCT Publ. No. WO 88/024066). 
[0148] Alternatively, foreign proteins can also be secreted from the cell into the growth media by creating chimeric 
DNA molecules that encode a fusion protein comprised of a leader sequence fragment that provide for secretion in 
yeast of the foreign protein. Preferably, there are processing sites encoded between the leader fragment and the foreign 

55 gene that can be cleaved either in vivo or in vitro. The leader sequence fragment usually encodes a signal peptide 
comprised of hydrophobic amino acids which direct the secretion of the protein from the cell. 

[0149] DNA encoding suitable signal sequences can be derived from genes for secreted yeast proteins, such as the 
yeast invertase gene (EPO Publ. No. 012 873; JPO Publ. No. 62,096,086) and the A-factor gene (U.S. Patent No. 
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4,588,684). Alternatively, leaders of non-yeast origin, such as an interferon leader, exist that also provide for secretion 
in yeast (EPO Publ. No. 060 057). 

[01 50] A preferred class of secretion leaders are those that employ a fragment of the yeast alpha-factor gene, which 
contains both a "pre" signal sequence, and a "pro" region. The types of alpha-factor fragments that can be employed 

5 include the full-length pre-pro alpha factor leader (about 83 amino acid residues) as well as truncated alpha-factor 
leaders (usually about 25 to about 50 amino acid residues) (U.S. Patent Nos. 4,546,083 and 4,870,008; EPO Publ. 
No. 324 274). Additional leaders employing an alpha4actor leader fragment that provides for secretion include hybrid 
alpha-factor leaders made with a presequence of a first yeast, but a pro-region from a second yeast alphafactor. (See 
e.g., PCT Publ. No. WO 89/02463.) 

70 [0151] Usually, transcription termination sequences recognized by yeast are regulatory regions located 3 1 to the 
translation stop codon, and thus together with the promoter flank the coding sequence. These sequences direct the 
transcription of an mRNA which can be translated into the polypeptide encoded by the DNA. Examples of transcription 
terminator sequence and other yeast-recognized termination sequences, such as those coding for glycolytic enzymes. 
[0152] Usually, the above described components, comprising a promoter, leader (if desired), coding sequence of 

f5 interest, and transcription termination sequence, are put together into expression constructs. Expression constructs 
are often maintained in a replicon, such as an extrachromosomal element (e.g., plasmids) capable of stable mainte- 
nance in a host, such as yeast or bacteria. The replicon may have two replication systems, thus allowing it to be 
maintained, for example, in yeast for expression and in a procaryotic host for cloning and amplification. Examples of 
such yeast-bacteria shuttle vectors include YEp24 [Botstein et al. (1979) Gene 8:17-24], pCI/1 [Brake et al. (1984) 

20 Proc. Natl. Acad. Sci USA 81 :4642-46461, and YRp17 fStinchcomb et al. (1982) J. Mol. Biol. 158 :1571. In addition, a 
replicon may be either a high or low copy number plasmid. A high copy number plasmid will generally have a copy 
number ranging from about 5 to about 200, and usually about 10 to about 150. A host containing a high copy number 
plasmid will preferably have at least about 1 0, and more preferably at least about 20. Enter a high or low copy number 
vector may be selected, depending upon the effect of the vector and the foreign protein on the host. See e.g., Brake 

25 et al. , supra. 

[0153] Alternatively, the expression constructs can be integrated into the yeast genome with an integrating vector. 
Integrating vectors usually contain at least one sequence homologous to a yeast chromosome that albws the vector 
to integrate, and preferabty contain two homologous sequences flanking the expression construct. Integrations appear 
to result from recombinations between homologous DNA in the vector and the yeast chromosome [Orr-Weaver et al. 

30 (1 983) Methods in Enzymol. 101 : 228-245]. An integrating vector may be directed to a specific locus in yeast by selecting 
the appropriate homologous sequence for inclusion in the vector. See Orr-Weaver etal., supra . One or more expression 
construct may integrate, possibly affecting levels of recombinant protein produced [Rine et aj. (1 983) Proc. Natl. Acad. 
Sci. USA 80 :67501. The chromosomal sequences included in the vector can occur either as a single segment in the 
vector, which results in the integration of the entire vector, or two segments homologous to adjacent segments in the 

35 chromosome and flanking the expression construct in the vector, which can result in the stable integration of only the 
expression construct. 

[0154] Usually, extrachromosomal and integrating expression constructs may contain selectable markers to allow 
for the selection of yeast strains that have been transformed. Selectable markers may include biosynthetic genes that 
can be expressed in the yeast host, such as ADE2, HIS4, LEU2, TRP1, and ALG7, and the G418 resistance gene, 

40 which confer resistance in yeast cells to tunicamycin and G418, respectively. In addition, a suitable selectable marker 
may also provide yeast with the ability to grow in the presence of toxic compounds, such as metal. For example, the 
presence of CUP1 albws yeast to grow in the presence of copper ions [Butt et aj. (1 987) Microbiol, Rev. 51 :351 ]. 
[01 55] Alternatively, some of the above described components can be put together into transformation vectors. Trans- 
formation vectors are usually comprised of a selectable marker that is either maintained in a replicon or developed into 

45 an integrating vector, as described above. 

[01 56] Expression and transformation vectors, either extrachromosomal replicons or integrating vectors, have been 
developed for transformation into many yeasts. For example, expression vectors have been developed for, inter alia, 
the following yeasts:Candida albicans [Kurtz, et al. (1 986) Mol. Cell. Biol. 6:1421, Candida maltose [Kunze, et al. (1 985) 
J. Basic Microbiol. 25 :1411. Hansenula polymorpha [Gleeson, etal. (1986) J. Gen. Microbiol. 132 :3459; Roggenkamp 

50 et al. (1986) Mol. Gen. Genet . 202:3021, Kluyveromyces fragilis [Das, et al. (1984) J. Bacterid. 158 :11651, Kluyvero- 
myces lactis [De Louvencourt et al. (1983) J. Bacteriol. 154 :737; Van den Berg et al. (1990) Bio/Technology 8:1351, 
Pichia guillerimondii [Kunze etal. (1985) J. Basic Microbiol. 25: 141], Pichia pastoris [Cregg, et aL (1985) Mol. Cell. 
Biol. 5:3376; U.S. Patent Nos. 4,837,148 and 4,929,555], Saccharomyces cerevisiae [Hinnen et aj. (1978) Proc. Natl. 
Acad. Sci. USA 75: 1 929; Ito et aj. (1 983) J. Bacteriol. 153 : 1 63], Schizosaccharomyces pombe [Beach and Nurse (1 981 ) 

55 Nature 300 :7061, and Yarrowia lipolytica [Davidow, et al. (1985) Curr. Genet. 10:380471 Gaillardin, et aj. (1985) Curr. 
Genet. 10:491. 

[0157] Methods of introducing exogenous DNA into yeast hosts are well-known in the art, and usually include either 
the transformation of spheroplasts or of intact yeast cells treated with alkali cations. Transformation procedures usually 
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vary with the yeast species to be transformed. See e.g. , [Kurtz et aj. (1 986) Mol. Cell. Biol. 6: 1 42; Kunze etal . (1 985) 
J. Basic Microbiol. 25 :141: Candida]; [Gleeson et al. (1986) J. Gen. Microbiol. 132 :3459; Ftoggenkamp et aL (1986) 
Mol. Gen. Genet. 202 :302; Hansenula]; [Das et al. (1984) J. Bacterid. 158 :1165; De Louvencourt et al. (1983) J, 
Bacterid. 154 :1165: Van den Berg et al. (1 990) Bio/Technoloqv 8: 1 35: Kluyveromyces]; [Cregg et al. (1985) Mol. Cell. 
5 Biol. 5:3376; Kunze etal (1 985) J. Basic Microbiol. 25 :141 ; U.S. Patent Nos. 4,837,148 and 4,929,555; Pichia]; [Hinnen 
et al. (1978) Proc. Natl. Acad. Sci. USA 75 ; 1929; Ito et al. (1983) J. Bacterid. 153 :163 Saccharomyces]; [Beach and 
Nurse (1981) Nature 300 : 706; Schizosaccharomyces]; [Davidowetal. (1985) Curr. Genet. 10:39; Gai Hardin etal. (1985) 
Curr. Genet. 10:49; Yarrowia]. 

10 Example 1 - Detoxified LT 

[0158] A fragment of the gene for LT was extracted from plasmid EWD299 [Dallas W.S., Gill D M. and Fatkow S., 
1979, J. Bacterid., 139, 850^858] by digestion with the restriction enzymes Smal and EcoRI, and was recloned in the 
vector Bluescript KS suitable for producing single strands of DNA [Sambrook J., Fritsch E. and Maniatis, T. "Molecular 
is Cloning", Cold Spring Harbor]. 

[0159] BW313 cells were transformed by the clones thus obtained and allowed to grow for 14 hours in a culture 
medium consisting of Luria Broth with the addition of 1 jig/ml of uridine. 

[0160] A series of synthetic oligonucleotides (listed in Table 1 below), containing the mutation, or the desired bases 
instead of the natural ones, and a sequence of 10 bases upstream and 10 downstream of the same mutation, identical 
20 to the natural ones, was first of all synthesised chemically and then phosphorylated, 1 .5 pmol thereof being treated at 
37° C with 5 units of kinase. 

[0161] After halting the reaction with a 100 mM EDTA solution, the oligonucleotides were annealed to the single 
strand containing the LT gene, by heating for 5 minutes at 70°C and cooling slowty for about one hour in ice. 
[0162] At that stage there was added to this cold solution (25 uJ) a solution of free nucleotides, the enzyme DNA 
25 ligase and the enzyme DNA polymerase, in a final volume of 100 uJ. 

[0163] The solution thus obtained was kept for five minutes in ice, five minutes at ambient temperature and two hours 
at 37°C. 

[0164] Suitable cells of E. coli were transformed with the reaction mixture, in accordance with the usual techniques 
[Sambrook J., Fritsch E. and Maniatis T "Molecular Cloning" Cold Spring Harbor], and the site-directed mutagenesis 
30 was checked by sequencing of the clones obtained. 

[01 65] The Smal-EcoRI fragment containing the various mutations was substituted for the original Smal-EcoRI insert 
in the plasmid EWD299. 

[0166] The strains which encode the mutated toxin were then grown in 10 ml of Luria Broth for 12 hours at 37°C. 
[0167] The cultures were centrrfuged and the precipitate containing the cells was resuspended in 300 ml of a solution 
35 containing 25% of sucrose and 50 mM of Tris buffer at pH8, and the mixture was treated for one hour at ambient 
temperature with 1 mg/ml of a solution of Polymixin B. 

[0168] The presence of the toxoid in the periplasmatic supernatant liquor was verified by means of Western Blot and 
its toxicity was evaluated by the inducement or lack of inducement of morphological changes in Y1 cells (see Table 1 ). 
[0169] Y1 cells are adrenal tumour epithelial cells which become markedly more rounded when treated with a solution 

40 containing CT or LT [Yasamure Y, Buonassisi V. and Sato G., "Clonal analysis of differentiated function in animal cell 
cultures", Cancer Res., 1966, 26, 529-535]. The toxicity of CT and LT is correlated with this morphological transition. 
The periplasms supernatant is diluted with a solution of F10 medium, horse serum 1.5%, glutamine and gentamycin 
to lesser and lesser concentrations and Y1 cells (250000 cells/ml) are incubated with the resulting solutions for 48 
hours at 37°C under an atmosphere of C0 2 . The morphology of the cells is evaluated. 

45 [0170] In all cases, immunogenic^ was shown by correct assembly of the complete toxoid and by cross reaction of 
the toxoid with antibody to the wild type LT. 
[0171] The results are shown in Table I below. 

[0172] In this Table (and in Table II below) the toxicity symbols mean as follows: 
so +++ toxic after dilution 1 :2000 (wild type toxicity) 

++ toxic up to dilution 1 :250 
+ toxic up to dilution 1 :64 

55 

- not toxic, even undiluted 
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TABLE I 

Example Mutation Oligonucleotide Sequence Toxicity 

5 

1.1 LT Val-53-Asp 291 - ACCGGCTTTGATAGATATGAT- 3 1 1 

1.2 LT Val-53-Glu 291 - ACCGGCTTTGAAAGATATGAT- 3 1 1 

1.3 LT Val-53-Tyr 291-ACCGGCTTTTACAGATATGAT-311 

1.4 LT Ser-63-Lys 322 -GTTTCCACTAAGCTTAGTTTG~ 3 4 2 
10 1.5 LT Val-97-Lys 424 -ATGTTTAATAAGAATGATGTA-4 4 4 

1.6 LT Val-97-Tyr 424 -ATGTTTAATTACAATGATGTA-4 4 4 

1.8 LT His-107-Glu 454 -TACAGCCCTGAGCCATATGAA-4 7 4 -»/- 

1.9 LT Tyr-104-Lys 445 - ATTAGCGT AAAG AGCCCT- 462 

1.10 LT Tyr-104-Asp 445 - ATTAGCGTAGATAGCCCT- 462 
is l.ll LT Tyr-104-Ser 447 -TAGCGTAAGTAGCCCTCA- 464 

1.12 LT Pro-106-Ser 453 - ATACAGCAGCCACCCATA- 470 

Two mutation of serine ( Ser- 1 14 -Glu : 4 77 -GGAGGTGAAGCGTTAGG- 
494 and Ser - 1 1 4 -Ly s : 4 7 7 -GG AGGTTAAAGCGTTAGG-4 94) were also 
20 shown to exhibit substantially reduced toxicity. 



Comparative Examples 



35 



A 


LT 


LT Wild Type 






4fh 


B 


LT 


Arg- 


210-Asp 


769 


-ATATATCTCAACGAATATCAA-7 8 9 


+ 


C 


LT 


Leu- 


41-Phe 


113 


- ATATTAATTTCTATG ATC- 130 


NA 


D 


LT 


His- 


44-Phe 


121 


-CTTTATGATTTTGCGAGA- 138 


KA 


E 


LT 


Ala- 


45-Tyr 


125 


-ATGATCACTATAGAGGAA- 142 


NA 


F 


LT 


Arg- 


54-Ala 


152 


-GCTTTGTCGCGTATGATG- 169 


■hf 


G 


LT 


Arg- 


54-Lys 


151 


-GGCTTTGTCAAGTATGATGAT-17 1 


++ 


H 


LT 


Tyr- 


5 9 -Met 


167 


-ATG ACGGAATGGTTTCCA- 18 4 


-Hf 


I 


LT 


Val- 


60-Gly 


169 


-GACGGATATGGATCCACTTCT- 18 9 


NA 


J 


LT 


Ser- 


68-Lys 


193 


-AGTTTGAGAAAGGCTCACTTA-2 1 3 


++ 


K 


LT 


Ser- 


6 8 -Pro 


193 


- AGTTTG AG ACCAG CTCACTTA- 2 1 3 


NA 


L 


LT 


His- 


•70-Pro 


199 


-AGAAGTGCTCCTTTAGCAGGA- 219 


NA 


M 


LT 


Ala- 


72-Arg 


205 


-GCTCACTTAAGGGGACAGTCT- 225 


++ 


N 


LT 


Ala- 


72-His 


205 


-GCTCACTTACATGGACAGTCT- 225 


-w- 


0 


LT 


Arg- 


192-Asn 


565 


-GATTCATCAATTACAATCACA-58 5 


4H~ 



40 

(NA means "not assembled", i.e the holotoxin AB 5 is not 
45 formed at all) 



Example 2 - Detoxified CT 

so [0173] The procedure followed in the case of the gene for the toxin CT is analogous to that described above. 

[0174] A fragment containing the gene for a CT was amplified by means of the polymerase chain reaction (PCR) 
technique from plasmid pCT322. An alternative and equivalent source of the CT gene is plasmid pJM17 (Pearson et 
al, PNAS USA, 79, (1982), 2976-2980). 
[0175] The following two synthetic primers were used: 



1 ) GG C AG A TT CTAG A C CTCCTG ATG AAAT AAA 
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2 ) TGAAGTTTGGCGAMCTICTTAATTTGCCATACTAATTGCGGC^ATCGCAT 



s [0176] containing respectively an Xbal site and an artificial Hindi 1 1 site (shown underlined). 

[0177] The resulting amplified fragment, Xbal-Hindlll, which has a length of 1074 base pairs, contains the codons 
of the two sub-units, A and B, but not the sequence encoding the leader peptide of the A sub-unit. This fragment was 
recloned in Bluescript KS vector and was treated in accordance with the procedure described above for LT, so as to 
effect the site-directed mutagenesis. 



TABLE II 

15 Example Mutation Oligonucleotide Sequence Toxicity 

j 

2.1 CTVal-53-Asp ACGGGATTTGACAGGCACGAT 

2.2 CTSer-63-Lys GTTTCCACCAAGATTAGTTTG 

2.3 CTVal-97-Lys ATGTTTAACAAGAATGATGTA 
20 2.4 CTSer-106-Pro GGCATACAGTAGCCATCCAGA 

Comparative Examples 



25 



A CT Arg-192-Asn GAATG CTCCAAACTCATCGAT +++ 
B CTArg-54-His GG ATTTGTTCATC A CGATG AT ++ 



[0178] The following mutations also proved to abolish toxicity: His-107-Asn (TACAGTCCTAACCCAGATGAA) , Glu- 
110-Ser (TCATCCAGATTCGCAAGAAGT), Glu-112-Ala (CAGATGAACAAGCTGTTTCTG) and Ser-114-Glu (CAA- 
30 G AAGTTG AAGCTTTAGGT). 

[0179] It will be understood that the invention is described above by way of example only and modifications of detail 
may be made within the scope and spirit of the invention. 

35 Claims 

1. An immunogenic detoxified protein comprising the amino acid sequence of subunit A of a cholera toxin (CT-A) or 
a fragment thereof or the amino acid sequence of subunit A of an Escherichia coli heat labile toxin (LT-A) or a 
fragment thereof wherein the amino acid at, or in the position corresponding to Ser-63 is replaced with another 

40 amino acid. 

2. An immunogenic detoxified protein according to claim 1 comprising replacement Ser-63-Lys. 

3. An immunogenic detoxified protein according to claim 1 or claim 2 wherein additionally one or more amino acids 
45 at, or in positions corresponding to Arg-7, Asp-9, Arg-1 1 , His-44, val-53, Arg-54, Ser-61 , His-70, Val-97, Tyr-1 04, 

Pro-106, His-107, Glu-110, Glu-112, Ser-114, Trp-127, Arg-146 or Arg-192 are replaced. 

4. An immunogenic detoxified protein according to claim 3 comprising one or more of the following amino acid re- 
placements: Val-53-Asp, val-53-Glu, val-53-Tyr, val-97-Lys, Val-97-Tyr, His-107-Glu, Tyr-104-Lys, Tyr-104-Asp, 

so Tyr-1 04-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

5. An immunogenic composition for use a vaccine comprising an immunogenic detoxified protein according to any 
one of the preceding claims and a pharmaceutically acceptable carrier. 

55 6. A vaccine composition comprising an immunogenic detoxified protein according to any one of claims 1 to 4 and a 
pharmaceutically acceptable carrier. 

7. A vaccine composition according to claim 6 further comprising an adjuvant. 
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8. A DNA sequence encoding an immunogenic detoxified protein according to any one of claims 1 to 4. 

9. A vector carrying a DNA according to claim 8. 

s 10. A host cell line transformed with the vector according to claim 9. 

11 . A process for the production of an immunogenic detoxified protein according to any one of claims 1 to 4 comprising 
culturing a host cell according to claim 10. 

10 1 2. A process for the production of a DNA according to claim 8 comprising the steps of subjecting a DNA encoding a 
CT-A or an LT-A or a fragment thereof to site-directed mutagenesis. 

13. A process for the formulation of a vaccine according to claim 6 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 4 into association with a pharmaceutically acceptable carrier 

15 

14. A process for the formulation of a vaccine according to claim 7 comprising bringing an immunogenic detoxified 
protein according to any one of claims 1 to 4 into association with an adjuvant. 

15. An immunogenic detoxified protein according to any one of claims 1 to 4 for use as a pharmaceutical. 

20 

16. An immunogenic detoxified protein according to any one of claims 1 to 4 for use as a vaccine. 

17. The use of an immunogenic detoxified protein according to any one of claims 1 to 4 for the preparation of a med- 
icament for vaccinating a mammal against Vibrio cholerae or an enterotoxigenic strain of Escherichia coil 

25 

18. A composition according to claim 5 or claim 6, in a form suitable for transdermal administration. 

19. An immunogenic detoxified protein according to claim 15 or claim 16, wherein said use is for transdermal admin- 
istration. 

30 

Patentanspruche 

Immunogenes entgiftetes Protein, das die Aminosauresequenz der Untereinheit A eines Choleratoxins (CT-A) 
oder eines Fragments davon oder die Aminosauresequenz der Untereinheit A eines hitzelabilen Toxins (LT-A) von 
Escherichia coli oder eines Fragments davon umfaBt, in dem die Aminosaure an der Position Ser-63 oder an einer 
Ser-63 entsprechenden Position durch eine andere Aminosaure ersetzt ist. 

Immunogenes entgiftetes Protein nach Anspruch 1 , das den Austausch Ser-63-Lys umfaBt. 

immunogenes entgiftetes Protein nach Anspruch 1 oder 2, in dem zusatzlich eine oder mehrere Aminosauren an 
den Positionen Arg-7, Asp-9, Arg-11, His-44, Val-53, Arg-54, Ser-61, His-70, Val-97, Tyr-104, Pro-106, His-107, 
Glu-110, Glu-112, Ser-114, Trp-127, Arg-146 oder Arg-1 92 oder an diesen Positionen entsprechenden Positionen 
ersetzt sind. 

Immunogenes entgiftetes Protein nach Anspruch 3, das einen oder mehrere der folgenden Aminosaureaustausche 
umfaBt: Val-53-Asp, VaI-53-Glu, Val-53-Tyr, Val-97-Lys, val-97-Tyr, His-107-Glu, Tyr-104-Lys, Tyr-104-Asp, Tyr- 
104-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

Immunogene Zusammensetzung zur Verwendung als Impfstoff, die ein immunogenes entgiftetes Protein nach 
einem der vorhergehenden Anspruche und einen pharmazeutisch vertraglichen Trager umfaBt. 

Impfstoff zusammensetzung, die ein immunogenes entgiftetes Protein nach einem der Anspruche 1 bis 4 und einen 
pharmazeutisch vertraglichen Trager umfaBt. 

Impfstoffzusammensetzung nach Anspruch 6, die weiterhin ein Adjuvans umfaBt. 
DNA-Sequenz, die ein immunogenes entgiftetes Protein nach einem der Anspruche 1 bis 4 codiert. 



1. 

35 



2. 

40 

3. 



45 

4. 



so 5. 



6. 

55 

7. 
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9. Vektor, der eine DNA nach Anspruch 8 tragt. 

10. Wirtszellinie, die mit dem Vektor nach Anspruch 9 transformiert ist. 

s 11. Verfahren zur Herstellung eines immunogenen entgifteten Proteins nach einem der AnsprOche 1 bis 4, das die 
ZOchtung einer Wirtszelle nach Anspruch 1 0 umfaBt. 

12. Verfahren zur Herstellung einer DNA nach Anspruch 8, wobei man mit einer ein CT-A Oder ein LT-A oder ein 
Fragment davon codierenden DNA eine ortsspezifische Mutagenese durchfuhrt. 

10 

13. Verfahren zur Formulierung eines Impfstoffs nach Anspruch 6, wobei man ein immunogenes entgiftetes Protein 
nach einem der AnsprOche 1 bis 4 mit einem pharmazeutisch vertraglichen Trager zusammenbringt. 

14. Verfahren zur Formulierung eines Impfstoffs nach Anspruch 7, wobei man ein immunogenes entgiftetes Protein 
*5 nach einem der AnsprOche 1 bis 4 mit einem Adjuvans zusammenbringt 

15. Immunogenes entgiftetes Protein nach einem der AnsprOche 1 bis 4 zur verwendung als pharmazeutisch wirksame 
Verbindung. 

20 16. Immunogenes entgiftetes Protein nach einem der AnsprOche 1 bis 4 zur Verwendung als Impfstoff. 

17. Verwendung eines immunogenen entgifteten Proteins nach einem der AnsprOche 1 bis 4 zur Herstellung eines 
Arzneimittels zur Impfung eines Saugers gegen Vibrio cholerae oder einen enterotoxigenen Stamm von Escheri- 
chia coli. 

25 

18. Zusammensetzung nach Anspruch 5 oder 6 in einer fur die transdermal verabreichung geeigneten Form. 

19. Immunogenes entgiftetes Protein nach Anspruch 15 oder 16, wobei die verwendung fOr die transdermal Verab- 
reichung ist. 

30 

Revendlcations 

1. Proteine detoxifiee immunogene comprenant la sequence d'acides amines de la sous-unite A d'une toxine de 
35 cholera (CT-A) ou un de ses fragments ou la sequence d'acides amines de la sous-unite A d'une toxine thermolabile 

d'Escherichia coli (LT-A) ou cfun de ses fragments ou I'acide amine a, ou dans la position correspondant a, Ser- 
63 est remplace par un autre acide amine. 

2. Proteine detoxifiee immunogene sekxi la revendication 1 comprenant le remplacement Ser-63-Lys. 

40 

3. Proceine detoxifiee immunogene selon la revendication 1 ou la revendication 2 dans laquelle en outre un ou 
plusieurs acides amines en, ou dans les positions correspondant a Arg-7, Asp-9, Arg-11 , His-44, Val-53, Arg-54, 
Ser-61, His-70, Val-97, Tyr-104, Pro-106, His-107, Glu-110, Glu-112, Ser-114, Trp-127, Arg-146 ou Arg-192 sont 
remplaces. 

45 

4. Proteine deloxifiee immunogene selon la revendication 3, comprenant un ou plusieurs des remplacements d'acides 
amines suivants: Val-53-Asp, val-53-Glu, val-53-Tyr, Val-97-Lys, Val-97-Tyr, His-107-Glu, Tyr-104-Lys, Tyr- 
104-Asp, Tyr-104-Ser, Pro-106-Ser, Ser-114-Glu, Ser-114-Lys. 

50 5. Composition immunogene pour utilisation comme vaccin comprenant une proteine detoxifiee immunogene selon 
Tune quelconque des revendications precedentes et un support pharmaceutiquement acceptable. 

6. Composition de vaccin comprenant une proteine detoxifiee immunogene selon Tune quelconque des revendica- 
tions 1 a 4 et un support pharmaceutiquement acceptable. 

55 

7. Composition de vaccin sekxi la revendication 6, comprenant en outre un adjuvant. 

8. Sequence d'ADN codant pour une proteine detoxifiee immunogene sebn Tune quelconque des revendications 1 
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a 4. 

9. Vecteur portant un ADN selon ta revendication 8. 

s 10. Lignee cellulaire note transformee avec le vecteur selon la revendication 9. 

11. Precede de production d'une proteine detoxifiee immunogene selon I'une quelconque des r even dicat ions 1 a 4 
dans lequel on cultive une cellule hdte selon la revendication 10. 

10 12. Procede de production d'un ADN sebn la revendication 8 comprenant les Stapes de soumission cfun ADN codant 
pour une CTA ou une LTA ou un de leurs fragments, a une mutagenese dirigee. 

13. Procede de formulation d'un vaccin selon la revendication 6 dans lequel on amene une proteine detoxifiee immu- 
nogene selon I'une quelconque des revendications 1 a 4 en association avec un support pharmaceutiquement 

is acceptable. 

14. Procede de formulation o°un vaccin selon la revendication 7, dans lequel on amene une proteine detoxifiee immu- 
nogene selon Tune quelconque des revendications 1 a 4 en association avec un adjuvant. 

20 15. Proteine detoxifiee immunogene selon I'une quelconque des revendications 1 a 4 pour utilisation comme agent 
pharmaceutique. 

16. Proteine detoxifiee immunogene selon I'une quelconque des revendications 1 a 4 pour utilisation comme vaccin. 

25 17. Utilisation d'une proteine detoxifiee immunogene selon I'une quelconque des revendications 1 a 4 pour la prepa- 
ration d'un medicament visant a vacciner un mammifere contre Vibrio cholerae ou une souche enterotoxig6ne 
d'Escherichia coil 

18. Composition selon la revendication 5 ou la revendication 6 sous une forme appropriee a I'administration transder- 
30 mique. 

19. Proteine deloxiftee immunogene selon la revendication 15 ou la revendication 16 dans laquelle ladite utilisatbn 
est pour I'administration transdermique. 

35 



40 



45 



50 



55 
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LT2 1 ---.FF T R-A L QQ-AYE PI 38 

LT1 1 FRS . 39 

LT1_1A 1 -G-R -R HN 40 

CT 1 NDDKLYRADSRPPDEIKQSGGLMPRGQSEYFDRGTQMNIN 40 



— E V — NT — N TVT — Q 1 — N — GS- 78 

Y 79 

Y L A — S Y 80 

LYDHARGTQTGFVRHDDGYVSTSISLRSAHLVGQTILSGH 80 



NE V-P L-D — G R Y-S-N-FA 118 

-LTIYI ... IS 116 

_ v ~Y— 120 

STYYI YVIATAPNMFNVNDVLGAYSPHPDEQEVSALGGIP 120 



L 1 SF-A-EGGMQ D — GDLF-G-TV — N — 158 

156 

N 1 — R E R — N -E- 160 

Y SQ I YGWYRVHFG VLDEQLHRNRG YRDR Y Y SNLD I APAAD 160 



— Q SNFP M — STF — EQ-VPNNKEFK-GV-I 198 

196 

— R D-Q Q DSS-TITGD — N 200 

GYGLAGFPPEHRAWREEPWIHHAPPGCGNAPRSSMSNTCD 200 



SA-NV — KYD-MNFKKLL — RLALTFFM — D-F-GVHGE 241 

. . 236 

-E—N-STIY-R D- . — EV- . IY . R 240 

EKTQSLGVKFLDEYQSKVKRQI FSG Y . QSDID . THNRI . KDEL 240 



Figure 1 
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LT AATGGCGACAGATTATACCGTGCTGACTCTAGACCCCCAGATGAAATAAAACGTTTCCGG 

NGDRLYRADSRPPDEIKRFR 20 



NDDKLYRADSRPPDE I KQSG 20 
CT AATGATGATAAGTTATATCGGGCAGATTCTAGACCTCCTGATGAAATAAAGCAGTCAGGT 

LT AGTCTTATGCCCAGAGGT . . • AATGAGTACTTCGATAGAGGAACTCAAATGAAIATTAAT 

SLMPRGQNEYFDRGTQMNIN 39 

GLMPRGQSEYFDRGTQMNIN 40 
CT GGTCTTATGC CAAGAGG ACAG AGTG AGTACTTTG A CCGAGGTACTCAAATGAATATCAAC 

LT CTTTATGATCACGCGAGAGGAACACAAACCGGCTTTGTCAGATATGATGACGGATATGTT 

LYDHARGTQTGFVRYDDGYV 59 



LYDHARGTQTGFVRHDDGYV 60 
CT CTTTATG ATCATGCAAG AGG AACTCAG ACGGG ATTTGTTAGGCACGATGATGG ATATGTT 

LT TCCACTTCTCTrAGTTTGAGAAGTGCTCACTTAGCAGGACAGTATATATTATCAGGATAT 

STSLSLRSAHLAGQY ILSGY 79 



STSIS LRSAHLVGQTILSGH 80 
CT TCCACCTCAATTAGTTTGAGAAGTGCCCACTTAGTGGGTCAAACTATATTGTCTGGTCAT 

LT TCACTTACTATAT AT ATCGTT ATAG CA AATATGTTTAATGTTAATG ATGTA 

SLTIYIVIA NMFNVNDV 96 



STYY I YVI ATAPvNMFNVNDV 100 
CT TCTACTTATTATATATATGTTATAGCCACTGCACCCAACATGTTTAACGTTAATGATGTA 

LT ATTAGCGTATACAGCCCTCACCCATATGAACAGGAGGTTTCTGCGTTAGGTGGAATACCA 

ISVYSPHPYEQEVSALGGIP 116 



LG AYSPHP DEQEVSALGGIP 120 
CT TTAGGGGCATACAGTCCTCATCCAGATGAACAAGAAGTTTCTGCTTTAGGTGGGATTCCA 



Figure 2a 
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TATTCTCAGATATAIGGATGGTATCGTGTTAATTTTGGTGTGAXTGATGAACGATTACAT 
YSQIYGWYRVNFGVIDERLH 136 



YSQI YGWYRVHFGVLDEQLH 140 
TACTCCCAAATAXATGGATGGTATCGAGTTCATTTTGGGGTGCTTGAXGAACAATTACAT 

CGTAACAGGGAATATAGAGACCGGTATTACAGAAATCTGAATATAGCTCCGGCAGAGGAT 
RNREYRDRYYRNLNIAPABD 156 



RNRGYRDRYYSNLDIAPAAD 160 
CGTAATAGGGGCTACAGAGATAGATATTACAGTAACTTAGATAXTGCTCCAGCAGCAGAT 

GGTTACAGATTAGCAGGTTTCCCACCGGATCACCAAGCTTGGAGAGAAGAACCCTGGATT 
GYRLAGFPPDHQAWREEPWI 176 



GYG LAGFPPEHRAWREEPWI 180 
GGTTATGGATTGGCAGGTTTCCCTCCGGAGCATAGAGCTTGGAGGGAAGAGCCGTGGATT 

CATCATGCACCACAAGGTTGXGGAGATTCATCAAGAACAATCACAGGTGATACTTGTAAT 
HHAPQGCGDSSRTITGDTC N 196 



HHAPPGCG NAPRSSISNTCD 200 
CATCAXGCACCGCCGGGTTGTGGGAATGCTCCAAGATCATCGATCAGTAAXACTTGCGAT 

GAGGAGACCCAGAATCTGAGCACAATATATCTCAGGGAATATCAATCAAAAGTTAAGAGG 
EEXQKLSTIYLREYQSKVKR 216 



E K TQSLGVKFLDEYQSKVKR 220 
GAAAAAACCCAAAGTCXAGGTGTAAAATTCCTTGACGAATACGAAXCTAAAGXXAAAAGA 

CAGAXATTTTCAGACTATCAGTCAGAGGTXGACATATATAACAGAATTCGGGATGAATTATGA 
QIFSDYQ5EVDIYNRIRDEL * 



QIFSGYQSDIDTHNRIKDEL. * 
CAAATATTTTCAGGCTATCAATCTGATATTGATACACATAATAGAATTAAGGATGAATTATGA 



Figure 2b 
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